Sequences and Limits || 


é 
® a 
3 
} 
é 
‘ 
; 


a 


The Open University 


Mathematics Foundation Course Unit 14 
SEQUENCES AND LIMITS II 


Prepared by the Mathematics Foundation Course Team 


Correspondence Text 14 


The Open University Press _ 


Open University courses provide a method of study for independent 
learners through an integrated teaching system including textual material, 
radio and television programmes and short residential courses. This text 
is one of a series that make up the correspondence element of the Mathe- 
matics Foundation Course. 


The Open University’s courses represent a new system of university 
level education. Much of the teaching material is still in a developmental 
stage. Courses and course materials are, therefore, kept continually under 
revision. It is intended to issue regular up-dating notes as and when the 
need arises, and new editions will be brought out when necessary. 


Further information on Open University courses may be obtained from 
The Admissions Office, The Open University, P.O. Box 48, Bletchley, 
Buckinghamshire. 


The Open University Press 
Walton Hall, Bletchley, Bucks 


First Published 1971 
Copyright © 1971 The Open University 


All rights reserved 

No part of this work may be 
reproduced in any form, by 
mimeograph or by any other means, 
without permission in writing from 
the publishers 


Printed in Great Britain by 
J W Arrowsmith Ltd, Bristol 3 


SBN 335 01013 X 


14.1 


14.1.0 
14.1.1 
14.1.2 
14.1.3 
14.1.4 
14.1.5 


14.2 


14.2.1 
14.2.2 
14.2.3 
14.2.4 
14.2.5 


Contents 


Objectives 
Structural Diagram 
Glossary 

Notation 
Bibliography 
Introduction 


Taylor’s Expansion 


Introduction 

The Tangent Approximation 
Convergence of an Iterative Method 
The Newton—Raphson Process 

The Quadratic Taylor Approximation 
The General Taylor Approximation 


Infinite Series 


Taylor’s Theorem 

The General Taylor Theorem 

Convergence of an Approximation Sequence 
Infinite Series 

Appendix 


ill 


FM 14.0 


FM 14.0 


Objectives 


After working through this unit you should be able to: 
(1) write down the tangent approximation to the image value of a 
suitable function ; 
(11) explain how the tangent approximation can be used to obtain a 
criterion for the convergence of the iterative method for solving 
an equation of the form x = F(x); 
(111) explain and apply the Newton—Raphson process for the solution 
of an equation of the form f(x) = 0; 
(iv) write down the Taylor and Maclaurin approximation formulas; 
(v) calculate the first few terms (and the general term in simple cases) 
in the Taylor expansion of a given function about a given point in 
the domain of the function; 
(vi) write down the statement of Taylor’s Theorem, and use the theorem 
to estimate the error in a given Taylor expansion; 
(vil) write down the definitions of the terms: 
infinite series 
partial sum of an infinite series 
convergent infinite series 
sum of a convergent infinite series; 
(vill) decide whether a given simple infinite series is convergent or 
divergent. 


Note 


Before working through this correspondence text, make sure you have 
read the general introduction to the mathematics course in the Study 
Guide, as this explains the philosophy underlying the whole course. You 
should also be familiar with the section which explains how a text is 
constructed and the meanings attached to the stars and other symbols 
in the margin, as this will help you to find your way through the text. 


Structural Diagram 


Geometric 
Progression 
R.B.8 


Binomial 
Expansion 
R.B.9 


sequences and 
Limits 


Differentiation 


Unit 12 


Errors and 
Accuracy 
Unit 2 


Taylor's 


Expansion 
14.1 


Taylor's Theorem 


4.25173 


Infinite Series 


14.2.4 


FM 14.0 


Convergence of the 
Iterative Method 
14.1.2 


Newton-Raphson 
Process 
14.1.3 


Appendix 
14.2.5 


Glossary 


Terms which are defined in this glossary are printed in CAPITALS. 


CONVERGENT SERIES 


CORRECTION 


DIVERGENT SERIES 


GEOMETRIC SERIES 


INFINITE SERIES 


MACLAURIN 
APPROXIMATION OF 
DEGREE Nn 


MACLAURIN’S 
EXPANSION 


NEWTON-—RAPHSON 
PROCESS 


PARTIAL SUM 
QUADRATIC TAYLOR 
APPROXIMATION 
SNOWFLAKE CURVE 


SUM 


TANGENT 
APPROXIMATION 


TAYLOR 
APPROXIMATION OF 
DEGREE nN 


V1 


A CONVERGENT SERIES iS an INFINITE SERIES of which 
the PARTIAL SUMS form a convergent sequence. 


A CORRECTION is the term which must be added to 
an approximation to cancel the error and thus yield 
the exact value of the required quantity, that 1s, 
(correction) = (exact value) — (approximation). 


A DIVERGENT SERIES iS an INFINITE SERIES of which 
the PARTIAL SUMS form a divergent sequence. 


An INFINITE GEOMETRIC SERIES 1S an INFINITE SERIES 
of the form 


a+ar+ar? +ar4+-:-: 

A FINITE GEOMETRIC SERIES has the form 
atar+ar?7 +--:+ ar". 

An INFINITE SERIES is an expression of the form 
ag ay + aS FS ae 

where 
Gg; He, 5, 45, 


is an infinite sequence. 


The MACLAURIN APPROXIMATION OF DEGREE 7 1s the 
TAYLOR APPROXIMATION OF DEGREE 7” about 0. 


MACLAURIN’S EXPANSION iS TAYLOR’S EXPANSION 
about 0. 


The NEWTON-RAPHSON PROCESS is a_ successive 
approximation method for solving an equation of 
the form 


based on the recurrence formula: 


f (uy) 


u =u, — 
aes : f'(Uy) 


The nth PARTIAL SUM of an INFINITE SERIES 1s the 
sum of the first ” terms of the series. 


The QUADRATIC TAYLOR APPROXIMATION is the 
TAYLOR APPROXIMATION of degree 2. 


The SNOWFLAKE CURVE is the curve described on 
page 39. 


The sum of a CONVERGENT SERIES is the limit of the 
sequence of PARTIAL SUMS of the INFINITE SERIES. 


The TANGENT APPROXIMATION corresponds to 
approximating the pictorial graph of a function by 
a tangent to the graph at some suitable point. 


The TAYLOR APPROXIMATION OF DEGREE 7 about a 
is the approximation to a function f by the poly- 
nomial of degree n: 


x f(a) + f'(a(x — a)+---4+ = 5% — a)’. 


FM 14.0 


Page 


38 


27 


38 


38 


38 


20 


12 


38 


15 


39 


38 


20 


TAYLOR’S 
EXPANSION 


TAYLOR’S THEOREM 


TAYLOR'S EXPANSION is a method of obtaining 
successive polynomial approximations to certain 
images under a function, using the images under 
the function and its derived functions of a single 
point in the domain. 


TAYLOR'S THEOREM is a theorem which gives a 
formula for the CORRECTION to the general TAYLOR 
APPROXIMATION. 


28 


Notation 
The symbols are presented in the order in which they appear in the text. 


exp The exponential function. 

a ‘is approximately equal to”. 
f(a) The derivative of f at a. 
f(a) The derivative of f’ at a. 


e emexp()) = 271828... 
A, The finite difference operator. 
A? A, ° Aj. 
D The differentiation operator. 
D" DoDco---D 
———— 
i? The nth derived function of f, D"f. 
k! Factorial k; that is, 1 x 2 x 3 x --: x k, wherekeZ”. 


Cais The correction to the tangent approximation for f(x) about 
some given point. 


The symbol for conjunction (“‘and”’). 


/\ 

V The symbol for alternation (“‘or’’). 

=> The symbol for implication (“implies”’). 

<> The symbol for equivalence (“implies and is implied by’’). 

C(x) The correction to the Taylor approximation of degree n for 
f(x) about some given point. 

S, The kth partial sum (the sum of the first k terms) of an infinite 
series. 


f The definite integral of f in [a, b]. 


(Fy, F(b) — F(a). 
In The logarithm function. 


Bibliography 


T. M. Apostol, Calculus Vol. I (Blaisdell, 1967). 

Apostol discusses the fitting of the Taylor polynomial on pages 272 to 283. 
He derives a slightly different formula for the correction, C,(x) (in our 
notation); he calls this correction term “‘the remainder” and denotes 
it by R(x). The definition of the sum of a convergent infinite series is 
given on page 384. 


R. Courant, Differential and Integral Calculus Vol. I (Blackie, 1966). 
Courant discusses the fitting of the Taylor polynomial on pages 315 to 
329, covering roughly the same ground as Apostol, but in more detail 
and at a more leisurely pace. The sum of a convergent infinite series 1s 
discussed briefly on pages 366 to 368. 


Like most books on calculus, both these books discuss various criteria 
for the convergence of infinite series, and they also give many examples 
of the use of Taylor’s and Maclaurin’s series for particular functions, 
together with rules for adding, multiplying, dividing, differentiating and 
integrating infinite series. The subject of infinite series and the representa- 
tion of functions by convergent sequences of polynomial approximations 
is a very big one, of which only a small fraction can be dealt with in our 
Foundation Course. 


Vill 


43 
ceo 


14.0 INTRODUCTION 


In this unit we return to a problem which we considered in Unit 4, Finite 
Differences: how to evaluate the images of real functions which cannot 
be expressed in terms of the elementary operations of arithmetic. In 
Unit 4 we based our discussion on the assumption that a table of images 
was available; it was then possible to approximate to the function (over 
a suitable interval of the domain) by a linear or polynomial function 
chosen to fit the function at the tabular points. In many cases, however, 
the tabulated values necessary for this method may not exist; and in 
any case, where do the numbers in the table come from? Even when 
the tables do exist, they may not provide the most suitable source of 
image values for the function. In particular, automatic computers 
cannot afford the space to store tables of functions; it is much more 
economical to store a sub-routine that calculates particular images 
from scratch as they are required (just as you might find it more convenient 
to work out the cost of 75 yards of curtain material at £1.71 a yard with 
pencil and paper rather than hunt around for a ready reckoner). 


The type of approximation method we shall develop here is similar to 
the methods we used in Unit 4, Finite Differences in that it is based on 
polynomial approximation. However, instead of being fitted to the 
images of a function corresponding to equally spaced tabular points, the 
polynomials are fitted by a different method, which can be regarded as 
the limiting form of the finite difference method for very small tabular 
spacing. Instead of requiring a knowledge of the values of the images 
of the function at a set of equally spaced tabular points, the new method 
requires a knowledge of the value of the function at just one point, together 
with its derivative, and perhaps also some of the higher derivatives 
(derivatives of derivatives) at the same point. The virtue of the new method 
is that it may be much easier to calculate a number of higher derivatives 
at a single point in the domain of the function than to calculate images 
of the function at a number of different points. In fact, for this reason, the 
new method plays an important part in the calculation of the tabular 
values which are used in the method of approximation described in 
Unit 4. 


We have already mentioned the use of this new method in computing 
images of the function ; it has many other uses too. It is used, for example, 
in studying the effectiveness of numerical approximation methods, such 
as the iterative methods for solving equations of the form x = f(x) which 
were described in Unit 2, Errors and Accuracy, and investigated further 
in Unit 7, Sequences and Limits I. In Unit 2 we discussed a way of discover- 
ing in advance whether the recurrence formula u, = F(u,-— ;), defining the 
iteration sequence, would or would not give a convergent sequence of 
approximations to the solution of the equation, using the idea of a scale 
factor. But this method is not always simple in practice, and to simplify 
the discussion of the convergence of the iteration it is useful to have a 
simple approximation to F(u) when u is close to the supposed limit of 
the iterative sequence: that is, to the solution of the equation x = f(x). 
The method we shall describe in this unit provides such an approxima- 
tion, and leads to a simple criterion for picking out the cases for which 
the iterative method converges. 


In addition to these applications to computing, the method we shall 
describe is valuable as a method of defining functions. For example, the 
definition of the exponential function given in Unit 7, Sequences and 
Limits I: 

k 


ie 


? (x € R) 


erp x iim 
k large 


is by no means universal; many people prefer to define the exponential 


FM 14.0. 


14.0 


Introduction 
x * 


function in terms of the limit of a sequence of polynomial functions of 
the type which we discuss in this correspondence text (though, of course, 
all the definitions define the same function). For the exponential function 
this may not seem particularly useful, since the definition already given 
is perfectly adequate; but mathematicians meet many functions which 
are more complicated than the exponential function, and for these the 
definition as the limit of a sequence of polynomials is often the most 
convenient. In fact this new method of defining functions was the starting 
point of developments which enormously expanded the scope of mathe- 
matics during the eighteenth century. Some of these developments will 
be mentioned in the radio component of this unit. 


14.1 TAYLOR’S EXPANSION 


14.1.0 Introduction 


Taylor’s expansion is a method of obtaining successive approximations 
to the images of suitable elements in the domains of certain functions. 


A typical case is the evaluation of sin (18°) = sin =. which is treated 


in the television component of this unit. The first method tried out 
there is to fit polynomials to known points on the graph of the sine 
function. Both linear and quadratic approximations are tried, but neither 
gives better than a 2% approximation to the true value, 0.3090 (to four 
decimal places). 


FM 14.0, 14.1.0 


14.1 


14.1.0 


Introduction 
xk * 


The reason for this low accuracy is essentially the large tabular spacing 
used. We could try to improve the accuracy by using interpolation 
polynomials of higher degree, or by using a smaller tabular interval, say 


= or even =; instead of = But both methods are cumbersome, the first 


because one has to use polynomials of very high degree to get high 
accuracy in the sines, and the second because the trigonometric formulas 


= ae Segoe ; 
from which we can calculate sin 1 sin ov and so on are rather complicated. 


A further disadvantage of this method is that it lacks generality: we 
happen to know some special images under the sine function, but for 
other functions, such as the exponential function or the logarithm func- 
tion, there is no such ready-made source of image values from which to 
make a table. It would be very convenient to have a method that did 
not require at the outset a set of tabular values of the function. 


The polynomial approximation method which we shall discuss in this 
text provides such a method; the polynomials are fitted using the images 
under the function and its various derived functions (first, second, third, 
etc.), of a single point in the domain, instead of images under the function 
alone of several different points in the domain. The method is called 
Taylor’s expansion. The special case of Taylor’s expansion where the 
chosen element in the domain is the number 0 is called Maclaurin’s 
expansion; this special case is the one treated in the television component 
of this unit. 


FM 14.1.0 


Brook Taylor 
1685-1731 
(University College Library) 


Colin Maclaurin 
1698-1746 
(Mansell Collection) 


FM 14.1.1 


14.1.1 The Tangent Approximation 14.1.1 


Just as in the theory of finite difference methods, where the simplest Main Text 
methods of interpolation and extrapolation are based on linear approx- — 
imations, so the simplest form of Taylor’s expansion is a linear approxima- 

tion. One way of arriving at this approximation is to consider a straight 

line chosen to pass through two points very close togethér on the graph 

of the function. The diagrams from the television programme which are 

shown below illustrate how, in the limit as the tabular spacing h approaches 

zero, the straight line approaches a tangent, which, in this particular case, 

is the tangent at the point where the curve passes through the origin. 


To calculate sin (=| by this method, we find the equation of the tangent 


to the curve at the nearest convenient point, which in this case is the 
origin. 


If we assume that the equation of the tangent is y = b) + b,x, then, 
since the tangent passes through the origin, we find b, = 0. Further, 
the derived function of sin is cos, and cos 0 = 1, so that the slope of the 
tangent, b,, is 1. Hence the equation of the tangent at the origin is 


y= > 
and our first (tangent) approximation to sin x is 
sin xX = x. 


Thus 
: 7 7 


(A method for calculating z is given in Unit 13, Integration IT.) 


Xe SIN X 


The same method can be applied to any real function f. We start by 
taking any two points on the curve, say (a, f(a)) and (a + h, f(a + h)), 
and obtain the equation of the line which gives the linear approximation 
for tabular spacing h. 


FM 14.1.1 


f(a + h) — f(a) 


The slope of the line joining the two points is —, and since 
1 


the line passes through (a, f(a)) its equation is 


y= fay + PET. - 
1 

We want the limiting form of this equation as the right-hand point 

approaches the left-hand one, that is, as h approaches zero. In this limit, 

the line still passes through the point (a, f(a)), but its slope is now the 

derivative at this point, namely f’(a), (see section 12.1.4 of Unit 12, Differ- 

entiation I). 


tangent 
(its slope 
is f'(a)) 


Accordingly, the equation of the tangent at (a, f(a)) 1s 


y = f(a) + fax — a). 


The tangent approximation is obtained by taking the right-hand side of 
Equation (1) as an approximation to f(x); that is, 


I(x) = fla) + f(ay(x — a). 


Exercise 1 


Find the equation of the tangent to the sine curve at the point 


ee 
g> Sin = 


may assume that sin 


—— ee 
, and use this as an approximation to estimate sin i) (You 


7 


6 


= (0.5000 and cos | | = 0.8660.) ee 


FM 14.1.1 


Equation (1) 


Definition | 


x ® 


Exercise 1 
(2 minutes) 


Exercise 2 


Would you expect the tangent approx- 


imation to give the greatest accuracy x exp x 
for large or small values of |x — a|? To 

test your answer, choose a value of a, 0.8 aa3 
and then calculate exp x from a tangent 0.9 2.46 
approximation for each of the values 1.0 2.72 
of x tabulated at the right; tabulate 1.1 3.00 
the error 1.2 3,32 


(tangent approximation to exp x) — exp x, 


and note how it varies with x for a fixed value of a. Do 


Exercise 3 


When a solid is heated it expands. The volume coefficient of thermal 
expansion of a solid may be defined as 


increase in volume due to a temperature increase of one degree 


original volume 
and the linear coefficient of expansion may be defined as 


increase in any linear dimension due to a temperature 
increase of one degree 


original linear dimension 


For copper, the volume coefficient of expansion is about 50 x 107° per 
degree Centigrade, and the linear coefficient is 16 x 10~° per degree 
Centigrade, which is about one third of the volume coefficient. Is this 
simple relation between the coefficients merely a coincidence? | 


FM 14.1.1 


Exercise 2 
(4 minutes) 


- Exercise 3 


(4 minutes) 


Solution 1 


From Equation (1), the equation of the tangent is 


+ cos 2 z 
=e 5. 4 eee 
6 6]. 
since sin’ = cos. The approximation to sin ra is therefore 


ck + ert = 
16° et 


= 0.5000 + 0.8660(0.3142 — 0.5236) 
= 0.3187. 


= -sin 
y 1 6 


(This approximation is about 3% larger than the correct value, 0.3090.) 
= 


Solution 2 


large error 


xe» f(x) 


small error 


@ F-------- 


From diagrams such as the above, we expect the approximation to im- 
prove as x approaches a. That is to say, the smaller |x — al is, the smaller 
will be the error in the approximation. 

Choosing a = 1 (only because this value appears in the middle of the 
table), we obtain 


! exp x ~ exp 1 + exp’ (1)(x — a) 
= 272 + 21x — 1) 
since 
exp’ (1) = exp (1) = 2.72. 


The approximations and their errors are given in the following table: 


Tangent 
x exp x x—a approxn. Error 
0.8 223 — (0.2 2.18 — 0.05 
0.9 2.46 — 0.1 2.45 — 0.01 
1.0 at4 0 2.72 0 
Li 3.00 0.1 2.99 —0.01 
ee: 3.32 0.2 3.26 — 0.06 


If you chose a different value for a, you should still have found the error 
larger, the larger the value of |x — a]. (In fact the error is roughly propor- 
tional to (x — a)*.) Of course, in special cases the above argument may 


FM 14.1.1 


Solution 1 


Solution 2 


FM 14.1.1 


not hold; for instance, if the curve ‘“‘bends back”’ towards the tangent or 
intersects the tangent somewhere close to a. But, in general, “‘near to”’ a 
the above argument is sound, although intuitive. 


f (x) 


«no error 


tangent 


Solution 3 | Solution 3 


No. Suppose we have a cube of any solid and we increase the temperature 
of the cube by 1 degree. If the length of the side of the cube is L, then the 
new length will be L(1 + x), where x is the linear coefficient of expansion. 
Thus the new volume is L°(1 + x)°. 


Thus the volume coefficient of expansion is 


Eee ee Pg 
a =(1+xP?—1. 
Now if 
f:x-— (1 + xP —-1 (eR), 
then 
f' :x-— 3(1 + x)? (x € R). 


So the tangent approximation to (1 + x)? — 1, using the tangent at 
x = Qs 
(i+ xy —1 = (0) + FOE — 9) 
= 3x. 


Thus the volume coefficient of expansion is approximately three times 
the linear coefficient for any solid — and so the case of copper was not 
merely a coincidence. = 


14.1.2 Convergence of an Iterative Method 


Although the tangent approximation is not very accurate, it is simple 
to use, and can be very effective when the accuracy it offers is sufficient 
for the matter in hand. Before proceeding to discuss how we can improve 
its accuracy, we shall consider how it can be used when solving equations. 
In this section we shall use it to obtain a criterion for the convergence 
of the iterative method for solving equations of the form 


% 2 F(X) 
such as the “omelette equation” 
x = sinx + 42 


introduced in section 2.4.2 of Unit 2, Errors and Accuracy. The iterative 
method for solving such equations was described in section 7.3.4 of 
Unit 7, Sequences and Limits I: we construct a sequence U,,Uz,U3,... 
in which the first term is any crude approximation to a solution of Equa- 
tion (1), and the later terms are calculated using the recurrence formula: 


us Fim.) (k = 7, 3, 4,...). 


It was shown in Unit 7 that, if this sequence converges to a limit a and F 
is continuous at a, then a is a solution of Equation (1). To avoid wasting 
time calculating the elements of non-convergent sequences, it is useful 
to have a simple criterion by which we can tell, without actually doing 
the calculation, which solutions of Equation (1) (if any) can be found by 
this method. 

In Unit 2, Errors and Accuracy we discussed one way of finding such 
a criterion, but the tangent approximation gives a simpler method. To 
start with, we suppose that the sequence u,,uz,... does converge, and 
that its limit is a. Then, for large k, the numbers u, are close to a, and so 
it is natural to consider using the tangent approximation to simplify 
the right-hand side of the recurrence formula 


n= Fo) a ae ee | 
The tangent approximation for F(u,_,) that is useful when u,_, is close 
to ais 
F(u,—,) =~ F(a) + F'(a)(u-1 — 4) 
(see Equation 14.1.1.1). 
Substituting this into Equation (2), we obtain 


u, = F(a) + F’(a)\(u,—1 — 4). 


Since a is a solution of the equation x = F(x), we have a = F(a), and 
so this last approximation is equivalent to 


uy — a Fp; — a) 


That is, when k is large, the deviation of the kth term, u,, from the limit, a, 
is u, — a, and differs from the. preceding deviation, u,_, — a, by a factor 
F’(a), which is independent of k; that is, 


kth deviation ~ F’(a) x ((k — 1)th deviation). 


It follows that, when k is large, the deviations will increase as k increases 
if |F’(a)| > 1. But if the sequence u,,u,,... converges to a limit a, then 
the deviations from a must eventually decrease as we take elements later 
and later in the sequence. Thus if the iterative sequence converges to a, 
then |F’(a)| < 1. 


Notice the if in that last sentence. To complete the criterion it would 
be nice to be able to prove the converse statement: “if a = F(a) and 
|F’(a)| < 1, then the iterative sequence converges to a’’. It is not quite 


FM 14.1.2 


14.1.2 


Discussion 
x * 


Equation (1) 


Equation (2) 


as simple as this, however; for example, there might be two different 
numbers a, and a,, both being solutions of x = F(x) and such that 
|F’'(a,)| < 1 and |F’(a,)| < 1, but the sequence could not possibly converge 
to both of them, since the limit of a convergent sequence is unique. What 
we can say is that ifa = F(a)and|F‘(a)| < 1, and u, is chosen close enough 
to a, then the sequence u,,u,,... will converge to a, for then the devia- 
tions u, — d,u, — a,... approximately form a geometric progression 
converging to zero. If, however, u, is chosen so far from a that the tangent 
approximation for F(u,) is very inaccurate, then we have no reason to 
expect the sequence to converge to a. It may ultimately converge to a 
anyway, but it may converge to some other solution of x = F(x), or it 
may not converge at all. 


Exercise 1 


The equation x = x* + 45x has two solutions. Without calculating 
iterative sequences, predict which of them can be computed using the 
iterative method based on the recurrence formula 


a | - 
ly = Ui, + FU. & 


14.1.3. The Newton—Raphson Process 


As our second application of the tangent approximation in numerical 
methods, we shall use it to obtain a method for the numerical solution 
of equations which has very good convergence properties. The new 
method, known as the Newton—Raphson process, is again an iterative 
method, but here the tangent approximation-is an integral part of the 
method, instead of being used almost as an afterthought to discuss the 
convergence. 


Since we are not now interested in the iteration u, = F(u,_,), we shall 
not write the equation to be solved in the form x = F(x), but in the more 
convenient form f(x) = 0. (The previous equation, x = F(x), can be put 
into this form by taking f(x) = x — F(x).) To construct the recurrence 
formula for the Newton—Raphson iteration, suppose that, after k — 1 
steps of the iteration, the latest approximation to the solution of f(x) = 0 
is u,; we use the tangent approximation to f near u, to estimate the value 
of x where f(x) = 0, and we take this estimate as our next approximation, 
U, +4 - Lhe calculations are illustrated in the figure: 


tangent 


By the tangent approximation formula, Equation 14.1.1.1, the tangent 
at (u,, f(u,)) has the equation: 


y= F (uy) + f(y) (x — Uy). 


FM 14.1.2, 14.1.3 


(See RB8) 


Exercise 1 


* (2 minutes) 


14.1.3 


Discussion 
x * 


Equation (1) 
(continued on page 12) 


as = 
& 


Solution 14.1.2.] “\ see L 


Solving the quadratic equation directly, we obtain the solutions x = 0 
and x = +. Convergence of the iterative sequence depends on the value 
of |F’(a)| = |2a + 4. If a = 0, |F’(a)| is 4, which is less than 1. So provided 
the initial guess is close enough to 0, the iterative method will work. If 
a = 5, then |F’(a)| > 1, so that we cannot get the solution x = } by the 
given recurrence formula, unless we choose a very lucky starting value. 
The following table gives a sample iteration. 


Sequence starting Sequence starting 


near 0 near 45 
u, 0.1 0.6 
U5 0.06 0.66 
U5 0.0336 0.7656 
U4 0.0179 0.9689 
Us 0.00928 1.423 
Us 0.00473 2.738 
u, 0.00239 8.863 
Ug 0.00120 82.979 

(converging to 0) (diverging) 


(continued from page 1/1) 


While it may not be possible to solve the equation f(x) = 0 exactly (that 
is why we need numerical methods at all), there is no difficulty in solving 
the equation 7 


(linear approximation to f(x)) = 0, 


because it is linear. Using the linear approximation on the right-hand side 
of Equation (1) in Equation (2), we obtain 
f (uz) + f'(u,)(x — uz) = 0 


and the solution for x is 


ae f(u,) 
a ’ 2 
f'(uy) 
This is the value of x where the tangent approximation to f(x) is 0, and 
SO we use it as Our next approximation to the value of x where f(x) itself 


is 0. Accordingly, the recurrence formula for the Newton—Raphson 
method is | 


Ups, = UR — 


Exercise | 


Apply the Newton—Raphson method to find a solution of the ‘“‘omelette 
equation’, 


x = sinx + 47, 


lying between 2 and 3, and compare the number of steps required to 
achieve 3-figure accuracy with the number of steps (13) required in the 
method we used in Unit 2. & 


Exercise 2 


Write down the Newton—Raphson recurrence formula for the equation 
x* — a = 0. Does it look familiar? eS 


FM 14.1.2, 14.1.3 


Solution 14.1.2.1 


Equation (2) 


Definition 1 


fe = 


Exercise 1 
(2 minutes) 


Exercise 2 
(2 minutes) 


FM 14.1.4 


14.1.4 The Quadratic Taylor Approximation 14.1.4 


The iterative methods for solving f(x) = 0 based on the tangent approxi- Discussion 
mation can usually yield any degree of accuracy provided we iterate for = 
long enough. But they only work if we can calculate f(x) for any x: 

they do not give us a method for calculating f(x) itself. We have not yet 


; ; us 
found a way of calculating sin 10 to better than about 3% accuracy. 


The tangent approximation is very good near the point where the tangent 
touches the curve, but the accuracy falls off rapidly away from this point; 


nu, ; : , 
To is too far away from the points of contact in the tangent approxima- 


‘ ; ; : us ‘ 
tions which we have tried for sin Fal To improve the accuracy we need 


something better than the tangent approximation. As in the case of 
interpolation from tabulated values, one way to try to improve the 
approximation is to use quadratic, or even higher-degree polynomials, 
in place of the linear ones we have been using so far. 


In the television component of this unit, we obtain the quadratic approx- 
imation by fitting a quadratic function of the form x-—> cy + ¢,xX + c)x%, 
where Cg, C,, C) are numbers, to the given function at equally spaced 
points in the domain, and then making the spacing h between these 
points extremely small. In the limit as h approaches 0, this quadratic 
approximation approaches the quadratic Taylor approximation. 


(continued on page 14 ) 


FM 14.1.3, 14.1.4 


Solution 14.1.3.1 Solution 14.1.3.1 
The equation is most conveniently put in the form f(x) = 0 by taking 

f(x) =x — sinx — 41. 
Then f’(x) = 1 — cos x, and the Newton—Raphson recurrence formula is : 


Uy, — Sin (u,_4) — 3% 


“i = hh, — 
. es 1 — cos (u,_ 1) 


The value of u, is not all that critical, but it is sensible to try to get it 
near to the solution. We have chosen u, = 2; you may well have chosen 
some other value, but you should get the same final result. Working to 
3 decimal places, we obtain 


w= 2 
uw, = 2/09 
u, = ZO 
ug = 2.605 
us = 2.605. 
Only four iterations are required to give 2.605, compared with the 13 
steps of our earlier method. = 
Solution 14.1.3.2 Solution 14.1.3.2 


In this case f(x) =x? —a and f(x) = 2x. The Newton—Raphson 
recurrence formula is: 


2 
u, = U = 
k = UR-1 — 
2Uy— 1 


. i @ 
eet ei 
U,—1 


6 Pee 
es ee “ee : 


This is Newton’s Formula for calculating the square root of a (see 
section 7.1.2 of Unit 7, Sequences and Limits I). & 


(continued from page /3) 


The graph of this limiting quadratic function will touch the graph of Main Text 
the original function; in the television programme we argue that the 
graph of this quadratic function not only has the same slope (first deriva- 
tive) but also has the same second derivative as the original function at 


the point of contact. Denoting the quadratic function which we are 
using to approximate to f by g, and the value of x where the curves 
touch by a, the conditions to be satisfied are 


equal images for a: q(a) = f(a) 
equal slopes at a: q(a) = f'(a) 
equal second derivatives ata: q’(a) = f(a) 


These three conditions provide just sufficient -information to determine 
the three coefficients co, c,, Cc, in the expression g(x) = cg + ¢,X + Cx”. 
The neatest way to use these conditions is to write the quadratic in the 
alternative form (analogous to the formula f(a) + f’(a)(x — a) for the 
tangent approximation): 


q(x) = by + b,(x — a) + B(x — a)’. 
Differentiating, we get 
q(x) = by + 2b,(x — a) 
q' (x) sy 2b, 
so that the values of the quadratic function and its derivatives at a are 
q(a) = bo 
q(a) = by 
q'(a) = 2b,. 
Comparing with Equations (1) we find that 
bo = f(a) 
b, = f(a) 
b, = zf"(a) 


so that, substituting in Equation (2), the quadratic Taylor approximation 
to f(x) (when x 1s close to a) 1s 


f(x) = fla) + flax — a) + Ff"(al(x — a)’. 


It is usually called the quadratic Taylor approximation to f about a. 
With this formula we can take any real function f and any element a 
in the domain of f, for which f(a) and its first two derivatives are known, 
and calculate an approximation to the image value of any other element 
x close to a in the domain. 


Exercise 1 
Use the quadratic Taylor approximation with a = | to evaluate exp (1.2) 
approximately, given that exp (1) = e = 2.72 to 2 decimal places. 


Compare the error in your result with the error of the tangent approxi- 
mation for the same number, which was given in the solution to 
‘Exercise 14.1.1.2. = 


Exercise 2 


Use the quadratic Taylor approximation with a =0 to estimate 


sin 10 


. In Exercise 14.1.1.1, the tangent approximation at 2 gave uS a 


3% error in sin 


Ha 
i) What is the error in this new approximation? (The 


true value of sin | a is 0.3090 to 4 decimal places.) & 


FM 14.1.4 


Equations (1) 


Equation (2) 


Definition ! 


Exercise 1 
(2 minutes) 


Exercise 2 
(3 minutes) 


Solution 1 
exp (i) = exp (1) = exp" (1) = 272 
to 2 decimal places. 
Therefore, using the quadratic Taylor approximation, we find: 
exp (1.2) ~ 2.72(1 + 0.2 + 3(0.2)’) 
= 332M 122 
= 3.32 to 2 decimal places 


which agrees with the true value of exp (1.2) (given in the table in 
Exercise 14.1.1.2) to 2 decimal places. Thus, the magnitude of the error 
is less than or equal to 0.005 + 0.005 = 0.01, so this approximation is at 
least 6 times as good as the tangent approximation to exp (1.2). = 


Solution 2 
The quadratic Taylor approximation to the sine function about 0 gives 


1 #\7 
= eee eo 
+ «Dx = 


10 
= 0.3142 to 4 decimal places. 


The quadratic Taylor approximation with a = 0 is in this case the same 
as the tangent approximation with a = 0, and is about twice as accurate 


; ‘ ; U5 ; ; : 
as the tangent approximation with a = 6 considered in Exercise 14.1.1.1 


(the error is about 1.5% instead of 3%). = 


An alternative derivation of the quadratic Taylor approximation is to 
take the limit as h approaches 0 in the Gregory—Newton formula for the 
quadratic that fits the given function f at the three tabular points a, 
a+handa-+ 2h. This formula was given in Unit 4, Finite Differences, 
section 4.3.2: it 1s: 


(6 — 1) 


5 Ai.f (a) 


quadratic = f(a) + 0A, f(a) + 


where 0 =“, A, f(x) = f(x + h) — f(x), and A? = A,A,. 


—a 
h 3 
Substituting for @ in the formula on the right-hand side, we have 


A , IA? 
4 oe = 2 he 


(x — a)(x — a — h). 


To obtain the Taylor approximation we take the limit as h approaches 
zero. Then, by the definition of a derivative (see Unit /2, Differentiation I, 


section 12.1.4) the coefficient of (x — a) becomes f’(a). The limit of the 


next coefficient, Af f(a)/h?, is not so easy to evaluate but the following 
argument helps: the definition of a derivative tells us that for small 


A 
enough h we can approximately replace the operator - by the differentia- 


(A,)” 


tion operator D, so it is plausible that we can also replace 2 that is, 


FM 14.1.4 


Solution | 


Solution 2 


Main Text 
- How 


A, A 
= : 7 by D? = D> D, in which case we would get 
_ Ai f(a) 
| h 
— h? 


So we find that as h approaches 0 the limit of the Gregory—Newton 
formula gives 


f(x) = f(a) + flax — a) + 3f"(ay(x — a)? 


= D?f(a). 


which is the same as the quadratic Taylor approximation we arrived at. 


earlier. 


(The reason why this argument is not a proof is that all our definitions 
and theorems about approximations and limits refer to numbers. We have 
no definitions or theorems relating to the approximation of operators 
such as A, and D, so that although we could write down A,/h ~ D or 
even fim A,/h = D, these formulas would have no precise meanings, 


and we would be unable to deduce that (A,/h)? =~ D? or lim (A,/h)*? = D?. 
h~0 


In fact it is possible to prove that these results do hold, provided the 
function f is, in a well-defined sense, well-behaved when x is close to a.) 


Exercise 3 
Evaluate 
Ait (a) 
na .— 
meg a 


where f(x) = x° (xe R), and a is any given real number; compare the 
result with the value of D?f(a) obtained from the rules of differentiation. 
= 


FM 14.1.4 


Discussion 


* 


Exercise 3 
(3 minutes) 


Solution 3 
Az f(a) A,(A,f)(a) 


h? h? 


_ A,f(a + h) — AS@) 


p2 
fla + 2h) = fle + fh) fie tte 
ee 
With f(x) = x, this reduces to 
Arf(a) _6ha+ 6h? 


h? h? 


Thus, we have 


A2 
ia = 6a when f@j<2* = 4re R). 
n>o0 h 
Differentiating f, we get Df(x) = 3x? and D’f(x) = 6x, and so D’f(a) = 6a, 


A2 
she in this particular case. z 


which is the same as lim 
h~0O 


FM 14.1.4 


Solution 3 


14.1.5 The General Taylor Approximation 


In the previous section we showed how the quadratic Taylor approxima- 
tion gives, in general, a better approximation than the linear one (the 
tangent approximation); but for some purposes (such as the problem 


; = : us 
considered on television, the calculation of sin 10 to 7 places of decimals) 


even the quadratic approximation is not adequate. To look for even better 
approximations, it is natural to try the same method with a cubic poly- 
nomial, or one of even higher degree. 


In this section we formulate the Taylor approximation that uses a poly- 
nomial of any degree, say the nth. By analogy with the method that worked 
for the quadratic polynomial, let us write the polynomial of degree n 
in the form: 


p(x) = by + by(x — a) + ba(x — a)? +--+ + b,(x — a)" 


where by, b,,...,b, are numbers. (At this stage there is no reason to 
assume any connection between the numbers bo, b, and b, in this section 
and the coefficients of the quadratic polynomial in the preceding section, 
but we shall see presently that they are in fact the same.) How do we 
determine the numbers b,,...,5,? Since there are n + 1 of them, we 
need n + 1 conditions to fix them all. From the previous section we 
already have three conditions 


p(a) = f(a) 
p'(a) = f'@) 
p'(a) = f(a) 


where f is the function we are trying to approximate. It is natural to 
impose the remaining conditions by continuing the list: 


pa) = f(a) 
pa) = fa) 


pa) = fa) 


where f(a) means the nth derivative of f at a. The complete list gives 
us exactly n + 1 conditions, and it is plausible to use these to determine 
the numbers by, ..., 5, in the definition of p. The next two exercises deal 
with the determination of these numbers. 


Exercise 1 


If c is a polynomial function defined by 
c(x) = by + b,(x — a) + b,(x — a)? + b3(x — a)? 


and c(a) and the first three derivatives c’(a), c"(a) and c’(a) are equal to 
f(a), f(a), f’"(a) and f’’(a) respectively, find by, b,, b, and b;, and hence 
write down a formula giving c in terms of f and its first three derivatives 
at a. & 


Exercise 2 


Guess the formula for the Taylor approximation by a polynomial of 
degree n, where n is any positive integer. (The answer is given in the text 
following Solution 1.) = 


FM 14.1.5 


14.1.5 


Main Text 


Exercise 1 
(3 minutes) 


Exercise 2 
(3 minutes) 


FM 14.1.5 


Solution | Solution 1 
We have 
c(x) = by + b,(x — a) + b,(x — a)* + b3(x — a)? 
(y= b, + 2b,(x — a) + 3b,(x — a)? 
c"(x) = 2b, + 6b3(x — a) 
ctx) = 6b. 
Therefore 
c(a) = b,) (and we are given that c(a) = f(a)) 
c(a)=b, (and we are given that c’(a) = f’(a)) 
c"(a) = 2b, (and we are given that c’(a) = f"(a)) 
c"(a) = 6b, (and we are given that c’”"(a) = f’’(a)). 


Thus, by = f(a), b; = f(a), bs = 4f'"(a), bs = &f"'(a), and so the formula 
for ¢ is 


(x) = f(a) + fax — a) + zf"(al(x — a)’ + ef"(a\(x — a)’. 


a 

The formula for the nth degree Taylor polynomial approximation can Main Text 
be calculated by writing it in the form 

p(x) = by + b,(x — a) + bo(x — a)? +--+ + b,(x — a)" 
and using the n + | conditions 

p(a) = f(a), p'(a) = f'(a),..., Pa) = fa) 
to determine the n + | numbers bo, b,,...., b,,- 
We thus obtain the Taylor approximation of degree n: Definition 1 


f(x) =~ f(a) + f'(a)(x — a) + $f'"(a)(x — a)? + -:- 
ae + BS May(x — ay +.-- + pif — a) 


ee 


general term 


or, in Summation notation, 


SS 
fe) = fla + Y SP aylx — ay 
k=1 Kk. 


This is usually referred to as Taylor’s approximation to f about a. The 

factorials* in the denominator arise because the kth derived function of 

x——>(x — a} is x-—— KI. 

The value of a for which this approximation is simplest is usually 0, 

and the resultant form of the Taylor approximation is common enough 

to have a special name: it is called the Maclaurin approximation. Its Definition 2 
formula is 7 


f(x) = f(0) + f'O)x + 4f"(0)x? + --- 
| 
ee - — f®)(Q)x* ee = — f™(0)x". 
k! n!} 
The Maclaurin approximation is a Taylor approximation to f about 0. 


In the television programme we obtained the Maclaurin approximation 
directly, instead of via the Taylor approximation, and we applied it to 


* The symbol k! denotes the product | x 2 x 3 x --- x k, and is read ‘“‘factorial k’’. 


20 


FM 14.1.5 


the case where f is the sine function. For this function the derivatives at 
0 are: | 


and the pattern 0, 1,0, —1,0, 1,0, —1,... goes on repeating itself; sub- 
stituting these values into the Maclaurin approximation we get the 
successive approximations: 


in=j4 or2Z) sinx = * 


(n = 3 or 4) siaxax— > 

. a 
in = 5 or 6) mx k= + 

2 x x! 
(n = 7 or 8) ek eae a tS — S, 


and so on. 


In the television programme we show two tests of this approximation 
method for sin x. One is to compare the graphs of the polynomial approx- 
imations with the graph of the sine function. Here are some of the results: 


21 


FM 14.1.5 


As we make n larger and larger the approximation gets better and better, 
in the sense that the polynomial fits the sine curve over a wider and wider 
interval, and there seems to be no restriction on the width of the interval 


pa: 


if we take a polynomial of sufficiently high degree. But note that the 
diagrams are somewhat deceptive, since the lines must have some thick- 
ness in order to be visible, and all they really show is that the approximat- 
ing curve gradually “‘follows”’ the sine curve as the degree of the approx- 
imating polynomial increases. 


The second (and better) test used in the television programme is to 


calculate the first few approximations to sin | = . These are 
7 7 
=1 sinj—]|] ~— = 0.31 
n sin (| 10 0.3141593 
= 5% Tt 1} x\° 
n=3 sin ra ae ale = 0.308992! 
n= 5 = 0.3090176 
ee, = 0.3090170. 


If we continued with higher values of n the result would still be 0.3090170 
to 7 significant figures. This number agrees with the measured value of 
0.3090 + 0.0001, to within the estimated error. This is a good demonstra- 
tion (though not, of course, a proof) of the validity of Maclaurin’s approx- 
imation for the sine function. 


It is a remarkable fact that, knowing the images of the sine function and 
its derived functions at the single element 0, Maclaurin’s formula gives 
us a method of investigating the images of all the real numbers under the 
sine function. 


Exercise 3 k | 1 T 
Find the general Maclaurin approxima- k 
tion to the exponential function, and 1 11 
calculate the first few Maclaurin ») 1 1025 
approximations for exp(0.1). to 3 3 1.1034 
decimal places. Compare your results 
4 1.1038 
with the calculation of exp (0.1) directly 5 1.1041 
from the definition, given in section 7.4.1 6 1.1043 
of Unit 7, Sequences and Limits I; the 7 11044 
first 10 steps of that calculation are 
8 1.1045 
given in the table. 9 1.1046 
10 1.1046 


Exercise 4 


Find the general Maclaurin approximation to the cosine function, and 
calculate the first three distinct Maclaurin approximations for cos (0.3) 
to 3 decimal places. Compare your results with the true value, 0.9553. 


In the examples considered so far, Maclaurin’s approximation has been 
extremely successful; the following exercise shows that this is not always 
the case. 
Exercise 5 
Find the general Maclaurin approximation to the function 
x-— (1 — x) (xe R, x # 1), 
where s is any real number. 
Do you recognize the approximation when s is a positive integer? 


Calculate the first few Maclaurin approximations to (1 — x)~', where 
(i) x = O4 (ii) x = 10. zg 


23 


FM 14.1.5 


Exercise 3 
(3 minutes) 


Exercise 4 
(2 minutes) 


Exercise 5 
(5 minutes) 


SE ca a Sac a ce ge ee 


FM 14.1.5 


Solution 3 Solution 3 
Since the nth derivative of exp at x is exp (x) for all n, the general Maclaurin 
approximation of degree n for exp (x) is simply 


1 
exp (x) ~ exp (0) + x exp (0) + 3x* exp (0) + --- + — exp (0) 


1 
=e ee 


The first-degree approximation to exp (0.1) is 
exp (0.1)~ 14+ 0.1 = 1.1. 
The second-degree approximation is 
exp (0.1) ~ 1 + 0.1 + 0.005 = 1.105. 
The third-degree approximation is again 
exp (0.1) ~ 1.105 to 3 decimal places, 


and the fourth and higher degree approximations also give 1.105. Thus 
the Maclaurin approximations for exp (0.1) converge much more quickly 
than the calculation directly from the definition —the second-degree 
approximation is already correct to 3 decimal places. & 


Solution 4 Solution 4 
D cos(x) = —sinx gives Dcos(0)= 0; 
D? cos(x) = —cosx gives D*cos(0) = —1: 
D°*cos(x)=  sinx gives D*cos(0)= 0; 
D* cos(x) = cosx gives D*cos(0)= 1: 


and so on. 


Therefore, the Maclaurin approximation to the cosine function contains 
only even powers of x; so for any positive integer n the Maclaurin poly- 
nomial approximations of degrees 2n and 2n + 1 are the same, and are 
given by 


a I 4 n I 2n 
“asst * +3* at 


The approximation of degree 0 (or 1) to cos (0.3) is therefore 
cos (0.3) = 1. 

The approximation of degree 2 (or 3) is 
cos (0.3) ~ 1 — x 0.09) = 0.955. 

The approximation of degree 4 (or 5) is 
cos (0.3) ~ 1 — (} x 0.09) + (4 x 0.0081) = 0.9553 


which agrees with the true value to 4 decimal places. = 


Solution 5 Solution 5 
Let 
fix —. xy (xe K, x # 1): 
then 
Df :x-—— —s(1 — x)s™! (xe R, x # 1), 
D?f :x-—— > s(s — 1)(1 — x}*~ (xe R, x 1), 


24 


FM 14.1.5 


and, for anyneZ", 


D"f:x+—>(—1)"s(s — 1)...(s —n + IU —- xf" 
(xe R, x # 1). 


The general Maclaurin approximation of degree n is 


; is = 1) As Ts — 2, 
a eae ae ee Ss 222 . 
ee Se ee ee 
2 2 2. 


When s is a positive integer, the sth coefficient is (— 1)*, and each coefficient 

thereafter has a zero in the numerator, and is therefore equal to zero. 

You should recognize the above polynomial as the binomial expansion (See RB9) 
for (1 — x)’, which gives the exact value of (1 — x)° when s is a positive 

integer. 

When s = —1, the situation is vastly different. The general Maclaurin 
approximation of degree n is now 


(—)) = ss 
SS. 


(—1) x (—2) x (-3)., 
See = Se 


(—1) x (—2) x --- x (—n) 
tx 7 Ko 


(1 —x)'~1—(-1)x + 


+" 


---+(—1)" 
=1+x+x74---4x" 


In this case, since s is not a positive integer, there is no exact polynomial 
expression for (1 — x)*, and so there is no “‘final’’ polynomial in the 
sequence of Maclaurin approximations. 


(i) When x = 0.1, 


: 1 10 

(1 — x)7? = = Pil}. 
The first approximation is yt: 
the second approximation is ci: 
the third approximation is St; 
ete. 

(ii) When x = 10, 

(l1—x) t+ =-$ = 411... 

The first “‘approximation” is 17: 


the second “approximation” is 111; 
the third “‘approximation” is 1111; 


etc. ca 


In the last exercise the method was successful for x = 0.1, but for x = 10 Discussion 
the “appreximations” bear no relation whatever to the correct value. 
Essentially the same thing can be shown by looking at the graphs of 


as 


the successive Maclaurin approximations to (1 — x)” '; these are shown 
in the following diagram* and its overlays: 


The graphs show that the nature of the approximation is not the same 
as in the case of sin x: for sin x the interval over which the approximation 
is good gets wider as the degree of the polynomial gets higher; but for 
(1 — x)~! the interval of good approximation is always contained within 
the interval [ —1, 1]. 


These results show that the Taylor (Maclaurin) approximation method 
is quite temperamental: sometimes it is very effective, but on other 
occasions the approximations it produces are wide of the mark. The 
method is a very powerful one, but to be able to use it without getting 
into trouble one needs either very sound intuition or some theorems 
that will specify the situations in which the method is successful. In the 
next section of the text we shall leave the exploratory approach we have 
been using and look at the theory of the Taylor approximation method 
from a rigorous point of view. 


* We have drawn only the part of the graph for which 1/(1 — x) > 0; the dotted line is the 
line specified by x = 1. 
The overlays are in the wallet on the inside of the back cover of this text. 


26 


FM 14.1.5 


FM 14.2.1 


14.2 INFINITE SERIES 14.2 
14.2.1 Taylor’s Theorem | 14.2.1 
The main purpose of this section is to enable you to recognize the situa- Introduction 


tions where the Taylor (or Maclaurin) approximation method works 
satisfactorily, so that you will know how to take advantage of the method 
without getting false results. To do this we shall follow the philosophy 
of Unit 2, Errors and Accuracy, and look for a bound on the error of the 
approximation. 


To explain the principle of the method, we consider first how to estimate 
the error in the simplest of the Taylor polynomial approximations, the 
tangent approximation. The absolute error in any approximation is 
defined to be 


(absolute error) = (approximation) — (exact value) 


(see Unit 2, Errors and Accuracy, section 2.1.1). 


It is a little more convenient to work not with the error itself but with its 
negative, which is the correction that must be added to the approxima- 
tion to cancel the error and thus yield the exact value: 


(correction) = (exact value) — (approximation). 


The error and the correction have the same magnitude (modulus), so 
that any bound on the magnitude of the correction is automatically an 
error bound too. For any given function f, let us denote the correction 
to the tangent approximation for f(x) about some given point a by C,(x), 
the subscript 1 indicating that this refers to the Taylor approximation 
of degree one. The formula for the tangent approximation (which we 
found in section 14.1.1) is 


f(x) = f(a) + f'(a@(x — a), 
and hence the correction is given by 
Ci(x) = f(x) — (f(a) + fax — a). Equation (1) 


Now C,(x) is the number we wish to estimate, but let us first get some 
idea of its size by trying some suitable approximations. 


MM PR - 


One way of getting an idea of the size of C,(x) is to replace f(x) on the 
right of Equation (1) by a convenient approximation. What approxima- 
tion would you suggest? The tangent approximation about a will not do, 
for that would give 


C,(x) = (tangent approx.) — (tangent approx.) = 0 


27 


FM 14.2.1 


which is no help. But, by using the next Taylor polynomial for f(x), we 
can get a useful estimate; it is 


C(x) = (f(a) + f'(a(x — a) + zf"(a)(x — a)’) 
—(f(a) + f(a — a) 
= 7f"(a)(x — a)’. 


Thus, C,(x) is roughly proportional to the square of the distance (x — a), 
and also to the second derivative of f at a. Both these facts can also be 
seen from the above figure, especially if it is redrawn to show how C,(x) 
depends on x. 


Exercise 1 Exercise 1 
(3 minutes) 


Use the formula 


SS a ee — 
Cy(x) = af “(a(x — a) RT 
to estimate (to 2 decimal places) the errerin-the tangent approximation 
at 1 to the exponential function for x = 0.8, 0.9, 1.1, 1.2, and compare 


with your calculated results for this error from Exercise 14.1.1.2. ea 
The problem now is to convert the rough estimate for the correction to Discussion 
the tangent approximation about a, 

C,(x) = 3f"(@(x — a)’, 


into a precise specification of the accuracy of this approximation. The 
above result suggests that it may be possible to specify the accuracy of 
the tangent approximation by a formula such as 

IC, (x)| < B(x — a)? Inequality (1) 
in which B is somehow related to the second derived function of f. 


In fact, this method of specifying the accuracy does prove satisfactory. 
It can be shown that the result holds provided B is an upper bound on 
the magnitude of the second derivative of f over the interval [a, x} (or 
[x, a] if x < a); that is, provided 


f"(O| < B (t e[a, x]). Inequality (2) 
Inequalities (1) and (2) together constitute a statement of Taylor’s Theorem 
for the tangent approximation. A proof is given in the Appendix. 
Example 1 Example 1 


As an illustration, let us apply Taylor’s Theorem to the case already 
considered in Exercise 14.1.1.2 and Exercise 1, where f is the exponential 


28 


FM 14.2.1 


function and a = 1. In this case the tangent approximation is 
exp x ~ exp(1) + (x — 1) x exp’ (1) 
= 2.7183 + (x — 1) x 2.7183, 
and Taylor’s Theorem tells us that the correction satisfies the inequality 
IC 1(x)| < zB(x — 1)’, 
provided B satisfies the inequality 
lexpt| < B (te[1, x]) 


(since exp” = exp). 


exp(t) 


tre> exp(t) 


Since exp t increases as t increases, its largest value for t € [1, x] is achieved 
when ¢ is the largest number in the interval [1, x], which is x if x > 1 and 
| if x < 1. Accordingly we can satisfy the last inequality by taking B 
to be the image under the exponential function of the largest number in 
the interval: 


xox tx >t 
 S 
e it x < 1. 


(We could take B larger than this if we wished, and still satisfy the required 
inequality, but this would weaken the bound given by the first inequality 
without gaining anything.) Thus. Taylor’s Theorem tells us that 


exp x ~ 2.7183 + (x — 1) x 2.7183 
with a correction of magnitude not exceeding 
s(exp x) x (2c — 17 2x > 1 
| Pet 


For example, if x = 0.8, Taylor’s Theorem tells us that the magnitude of 
the correction cannot exceed 


x x 2.7183 x (—0.2)? = 0.0544. 
The actual correction is 
exp (0.8) — (2.7183 + (—0.2) x 2.7183) 
= 2.2255 — 2.1746 
= 0.0509. 


If x = 1.2, Taylor’s Theorem tells us that the magnitude of the correction 
cannot exceed 


+ x 3.3201 x (0.2)? = 0.0664. (continued on page 30) 


29 


FM 14.2.1 


Solution | Solution 1 
exp (1) = exp’ (1) = exp” (1) = 2.72 (to 2 decimal places). 


-y “Therefore we have the following estimates (to 2 decimal places) for the 
to ertoxin the tangent approximation at | for x = 0.8, 0.9, 1.1, 1.2: 


C,(0.8) ~ 4 x 2.72 x (—0.2) 
= 0.05 
C,(0.9) ~ 4 x 2.72 x (—0.1) 
= 0.01 
C,(1.1) ~ 0.01 
C,(1.2) = 0.05 


which agrees with Exercise 14.1.1.2, except that there we obtained 
C,(1.2) = 0.06 instead of 0.05. 2 


(continued from page 29) 


The actual correction is 
exp (1.2) — (2.7183 + (0.2) x 2.7183) 
= 3.3201 — 3.2620 
= 0.0581. 


Thus in both cases the theorem is verified. 2 


Exercise 2 Exercise 2 
(2 minutes) 


Use Taylor’s Theorem with a = 0 to obtain an absolute error bound for 
the approximation 


exp x = 1 + X 
for x <= 0. 
Deduce that 
exp (— 0.2) € [0.78, 0.82]. 2 
Exercise 3 Exercise 3 


(2 minutes) 
Use Taylor’s Theorem to obtain an absolute error bound for the tangent 


Ss Se. 
approximation about 0 to sin ra 


10 
TU TU 
n{=-—} = — 
vig 
7 


and compare it with the actual error. (The correct value of sin re 


0.3090 to 4 decimal places.) = 


1S 


Exercise 4* — Exercise 4 
(2 minutes) 


If p denotes the proposition that Inequality (1) holds, and q denotes the 
proposition that Inequality (2) holds, which of the following propositions 
is logically equivalent to Taylor’s Theorem? 


Gi) pAgq (ii) p V q (ili) p> q 
(iv) q=>p (v) pq a 


* The symbols in this exercise were introduced in Unit //, Logic I. 


30 


FM 14.2.2 


14.2.2 The General Taylor Theorem 14.2.2 


In this section we show how the method discussed in the preceding Main Text 
section can be generalized to give an upper bound on the correction to aa 

a Taylor approximation polynomial of general degree. This generaliza- 

tion makes it possible not only to estimate the error in any individual 

Taylor approximation, but also to see whether the sequence formed by 

the successive polynomial approximations of increasing degree to a 

given function value is convergent and has that function value as its 

limit; it also gives a new and powerful method of defining or specifying 

functions. 


The Taylor approximation of degree n, obtained in section 14.1.5, is: 


1 
I(x) = f(a + flax — a) +--+ + it (ax — a)’. 


The correction associated with this approximation is therefore 
1 
C0) = flo) = [ fla) + Flayee = a) + + payer — ay), 


Just as in the case of the tangent approximation, we can get a rough 
approximation to C,(x) by using the next approximation for f(x). We 
obtain this by replacing n by n + 1 in the above Taylor approximation, 
and we find that 


1 
C,(X) = [Fa + f'(ay(x — a) +--+ + iF alx — a)" 


ee = a = = ay* 


I 
2 [sa + (x a) +++ += faj(x — a)"). 


Thus 


1 
(n + 1)! 


This suggests that there may be a useful formula for the accuracy of the 
nth degree Taylor approximation of the form: 


€ bes a) (x a ap 1 


C ix) 


1 
IC,(x)| < n+ Dio" = ayy Inequality (1) 


where B,,,, depends on f*'. As in the case of the tangent approxima- 
tion (n = 1), it is possible to show that a sufficient condition for the 
above inequality to hold is 


FOTO < Bis, (te (a, x), Inequality (2) 
where fer is continuous throughout the interval [a, x], and where [a, x] 
is to be interpreted as [x, a] if x <a. 


The statement that Inequality (2) implies Inequality (1) is the general form 
of Taylor’s Theorem. The proof of this is given in the Appendix. 


As an illustration of the use of Taylor’s Theorem, let us apply it to the 
Maclaurin approximation, 


which is used-in the television programme to calculate a number which 


TM : ; 
appears to be sin i correct to 7 decimal places. We are now in a 
10 (continued on page 33) 


3] 


FM 14.2.1 


Solution 14.2.1.2 Solution 14.2.1.2 


Taylor’s Theorem tells us that the magnitude of the error, which equals 
the magnitude of the correction, cannot exceed 


4Bx?, 
where |exp t| < B(te[x, 0]). 
Since expt increases with t, and x is negative, the largest value of expt 


for te[x, 0] is exp 0 = 1; so we may take B = 1. The magnitude of the 
error in the approximation, 


expx ~ 14x, 
is therefore at most 4x7. In the particular case when x = —0.2 we have 
exp (—0.2) ~ 1 — 0.2 = 0.8, 


with an absolute error bound of 4(—0.2)? = 0.02, from which it follows 
that 
exp (— 0.2) € [0.8 — 0.02, 0.8 + 0.02], 


that is, exp (— 0.2) € [0.78, 0.82]. a 


Solution 14.2.1.3 Solution 14.2.1.3 
The tangent approximation we are using is 
sinx ~ sin0 + xsin’ 0 
a, 
Taylor’s Theorem tells us that the magnitude of the error cannot exceed 
4Bx’, 
where |sin” t| < B(t €[0, x)). 


Since sin’ = cos and cos’ = —sin, we have sin” = —sin, and so the 
condition on B reduces to 


|sin t] < B (t € [0, x]). 


ue ; 
For the case when x = 10’ we require a number B such that 


Tl 
int] < B t 0, — |}. 
|sin ¢| | c| | 


For a quick estimate of the absolute error bound we may use the fact that 
sint always lies in the range [—1, 1], and take B = 1. We thus obtain 
the following value for the absolute error bound: 

1 i 2 ncomepg tn 

—~x |x 


) a oS 


which is a perfectly satisfactory answer to the question. 


Alternatively, we can do a little more work and get the “‘best”’ (that is, 
the smallest possible) value of B. 


sin t 


32 


Since sint increases with t in the interval [0 >| its largest value in 


0 | is sin = = 0.3090. This gives the absolute error bound: 
10 


(This is, of course, only of theoretical interest. Here we are discussing 
the accuracy of Taylor’s approximation, but in a practical case we might 


2 
1 x 0.3090 x = ~ 0.3090 x 0.05 = 0.015. 


ee 2. aoe 
want to calculate sin a using Taylor’s approximation, and then we 


could not use our calculated value of sin ra to obtain an error bound!) 

The magnitude of the actual error is 

sin |= nt 

1 SS = eS 
10 10 

Taylor’s Theorem over-estimates the error by a factor of about 3 when 

the “best” value of B is used. & 


= |0.3142 — 0.3090] = 0.0052. 


Solution 14.2.1.4 


The statement of Taylor’s Theorem given in the text is that Inequality (1) 
holds provided Inequality (2) holds; that is, if (2) holds, then (1) holds. 
In the notation used in the question, this statement is “‘if q, then p’’, and 
so the corresponding proposition is | 


(iv) q => p. = 


(continued from page 31) 


— : ee TM ‘ 
position to prove that this number really is sin 4 correct to 7 decimal 


places. The correction we are interested in here is 


eae ee 


eo a 


and according to Taylor’s Theorem its magnitude has the upper bound 


C.(x)-=sin x—|x 


bd 


1 
ICs) < = Balal® 


where Bg is such that 
lsin®.t| < Bs (t € [0, x]), 

or, since the 8th derived function of sin is sin itself, 
|sin t| < Bs (t € [0, x)). 


As in the solution to Exercise 14.2.1.3, it is not essential to find the ‘‘best”’ 
(that is, the smallest possible) value of B,; any value satisfying the last 
inequality will do. Since we know that |sin ¢| < 1 for all te R, it is con- 
venient to take B, = 1. (We can always come back later and look for 
the best possible value of B, if the bound we get using B, = 1 turns out 
to be too weak.) 


Substituting Bg = 1 into the inequality for |C-,(x)| gives: 


| |x|® 
< —|x|® = 
IC r(x) gi 40 320 


33 


FM 14.2.1, 14.2.2 


Solution 14.2.1.4 


Thus, if the 7th degree Taylor polynomial is used to calculate an approx- 


: TM : 
imate value for sin iF , the magnitude of the error is at most 


10 
Tt 8 
10 10-4 —— 
40 320 < Ax 10° (since nm = 99 =< 10) 
«= +x 16°. 


Therefore, working to 8 decimal places, the error in the Taylor approxima- 
tion is less than the possible error of } x 107° introduced by each arith- 
metical operation, and can be safely ignored when we have rounded-off 
the final answer to 7 decimal places. 


The astonishing thing about this calculation is not the 7-figure accuracy, 
but the fact that we can rigorously estimate the error in our calculation of 


sin 


7 : ; ; ; : 
a without knowing in advance anything about the value of sin [* 


We only used the first 7 derivatives of the sine function at 0 and the fact 
that the magnitude of the 8th derivative can never exceed 1. 


Exercise 1 


In Exercise 14.1.5.3 we evaluated exp (0.1) using various Taylor poly- 
nomials about 0. In particular, the second-degree polynomial gave 


exp (0.1) ~ 1+ 0.1 +4 x (0.1)? = 1.105. 


Use Taylor’s Theorem to find an absolute error bound for this approxima- 
tion. You may assume that exp (0.1) < 2. = 


14.2.3 Convergence of an Approximation Sequence 
The fact that we were able to obtain such an accurate approximation to 
sin *) using Maclaurin polynomials suggests that we may be able to 


get any accuracy we please if we use polynomials of sufficiently high 
degree. 
Is it really possible to get any accuracy we please? To answer this question, 


Tl Se 
we examine the absolute error bound, C,, a ; the question is whether 


10 
we can make o{ as small as we please by making n large enough — 
that is, whether 
T 
lim C,{—] = 0, 
bemecs | 4 


where 


2 x3 = x? x" = xr 1 
ee te at 5 nt — DIE 


| : eee 
(The last term in the polynomial has magnitude = ifn is odd, but 


eT 


if n is even.) 


34 


(n — 1)! 


FM 14.2.2, 14.2.3 


Exercise 1 
(2 minutes) 


14.2.3 


Discussion 
*x* 


Taylor’s Theorem tells us that 


l 
e < eee — 


provided B satisfies 


ft 
im”) bel < ) 
sine Se (te [0, x)). +) 
Whatever positive integer, n, we choose, sim” f is one of sin ft, cos t, —sin t 
and —cos t, and so |sin® 7 < 1 for all te R; so we can safely choose 


B = 1. Then we have 
1 1 el 
i 


| 7 ] 
ole) |< +1)! 


1 n+1 
< = 
F 


Consequently, we see that we can ensure that c, 


Tl 
10 
please by making n large enough. For example, to ensure that 


cl 


have shown that 


| is as small as we 


< 2~1°°° it is sufficient to take n = 999. In other words, we 


TU 
lim C,|—] = 0. 
— (7) 


Therefore, whatever accuracy is chosen, it is possible to specify n such 
that the Maclaurin polynomial of degree n gives sin ‘a to the required 


accuracy. 


A similar analysis can be carried out for the Maclaurin approximation 
to sin x for any x, and gives the same result: for any real number x and 
any stated accuracy, an integer n can be specified such that the Maclaurin 
polynomial of degree n gives sinx to the required accuracy. This fact 
(which we shall not prove here, though the proof is not difficult) is 
illustrated in the film in the television programme, which shows how the 
interval over which the approximating curve follows the sine curve 
expands as we include more and more terms in the Maclaurin polynomial. 
Similar results apply for other functions, for example, the cosine and the 
exponential functions. There are, however, functions for which the method 
only works for an interval of finite width in the domain of the function 
and others for which it does not work at all. We look at some of these 
in the final section of this text. 


Exercise 1 
Write down the Maclaurin polynomial 
approximations of degrees 1, 2, 3 and 4 : “ 
for cos x. Use Taylor’s Theorem to find , 1 
an integer n such that the Maclaurin 5 5 
polynomial of degree n for cos x gives an 3 = 
approximation for cos(2) that is accurate 4 74 
to 2 decimal places, and write down the 5 120 
Maclaurin polynomial of this degree. (If 6 730) 
you find this part very difficult, Solu- > 5 040 
tion 14.1.5.4 may be of some help.) The g AN 320 
table of factorials may also be useful: 9 362 880 
10 3 628 800 
& 


35 


FM 14.2.3 


Exercise 1 
(5 minutes) 


FM 14.2.2, 14.2.3 


Solution 14.2.2.1 Solution 14.2.2.1 


The correction to the Taylor polynomial approximation we are con- 
sidering is 


C,(x) = expx — (1 + x + 3x’), 


and Taylor’s Theorem tells us that 


|C2(x)| < B,|x|° 


1 
3! 
where 

lexp” t| < B, (t € [0, x]). 


Since all the derived functions of exp are also exp, and the value of x 
we are interested in is 0.1, the condition on B, can be written: 


lexp t| < B, (t € [0, 0.1)). 


Since exp t increases with t, its maximum value for t in [0, 0.1] is exp (0.1), 
which we know to be less than 2. So we may take B, = 2; then Taylor’s 
Theorem gives 


|IC,(0.1) <4 x 2 x (0.1)? =} x 10°’, 
and so the absolute error bound we are seeking is 
+x 107%. 
This shows, incidentally, that 
exp (0.1) = 1.105 to 3 decimal places, 


since 1.105 is the exact value of the quadratic Taylor approximation. 


Solution | Solution 1 
The required Maclaurin polynomials are: 

degree 1: cosx = | 

degrees 2,3: cosx ~ 1 — 4x? 

degree 4: cosx =~ 1 — 4x? + 34x" 
Taylor’s Theorem tells us that 

Cole Bala 

(n + 1)! 


provided |cos*") t| < B,,, (t€[0, x]). 


Since all the derived functions of cos are +cos or +sin, B,,, can be 
taken as 1 for all n: so, with x = 2, our problem is to find an n such that 


1 
(n + 1)! 


an+1 <1 x 10-7, to ensure that |C,(x)| < 4 x 10-7. 


Trying successive values of n, and using the table of factorials, we obtain: 


\ ee 

ao! a ee ee ee 

7 S60 se S 

\ — 5 
J 

"ee ES ee eee 

—— 600 ae 
Se Enis EOS * 
91” ~ 362880 ~ 360000 6002 


36 


Thus the conditions of the problem are satisfied with n = 8. You may 
have chosen a value of n larger than 8. This is also correct: it gives an 
even smaller absolute error bound. 


The Maclaurin polynomial approximation of degree 8 for cos x is 


1 
=a sce = alle 
cosx ~ 1 ie sae ta 1 rT 


14.2.4 Infinite Series 


So far we have shown how to obtain various polynomial approximations 
to the image of a given function, sin say, for a given element, x say, in 
its domain: 


sinx ~ x 
x 
ix = x — — 

3! 
= 
mas ye 

at&t 


etc. 


We have shown that, in favourable cases (of which this example is one) 
the sequence of successive approximations thus obtained converges and 
has the exact image value as its limit. This sequence of successive approx- 
imations differs a little from the ones we considered in Unit 7, Sequences 
and Limits I, in that each new element of the sequence is calculated 


Seg 58 


ee 
to the preceding approximation. The successive terms that we may add 
also form an infinite sequence: 


not from a recurrence formula, but by adding a term, such as 


ee 


—x? x 
eS 
To calculate one of the polynomial approximations to sin x, we choose 
a positive integer n, and add up the first n consecutive members of this 
sequence. The more consecutive members we add in, the better is the 
approximation to sin x. This is usually represented by writing 


a a hs 


ee ee ae ee 
The expression on the right-hand side of this equation is called an infinite 
series. The three dots are used to indicate that the expression does not 
terminate. 


The successive approximations: 


X, 
x3 
xX - — 
31° 
—— 
xXx - — a 
7 ee 
etc. 


are called the partial sums of the infinite series. Here the sequence of 
partial sums converges to a limit, namely sin x; this limit is called the 
(total) sum of the infinite series. 


37 


FM 14.2.3, 14.2.4 


14.2.4 


Main Text 


xk 


eI 


Equation (1) 


It is very important to understand just what we mean by an infinite 
series. We give these important definitions formally: 


An infinite series is an expression of the form 
G+ a5 ee 
The partial sums of the infinite series are the sums: 
S, = @; +a, +--: +4, k= 12 3 
If the sequence of partial sums, 
. ee ee ee 


converges to a limit S, then we say that the series converges (or is con- 
vergent) to the sum S, and we write 


S = 4, 4 4: +83 +>. 


If the sequence of partial sums does not converge, then we say that the 
series diverges (or is divergent): we cannot find a sum for it. 


It is important to note the difference between the infinite series 
a, +a, +a;+-°- 

and the infinite sequence 
ee oe eee 


An infinite series can be thought of as a way of specifying an infinite 
sequence of addition sums. 


Example 1 


You may have met the formula for the sum of k terms of a geometric 
progression, 

1—r* 
l-r 


atartar te tat! = al | (re R,r # 1). 


This is the kth partial sum, S,, of the infinite series 
a+ aor 


This series is called the infinite geometric series; the number r is called 
the common ratio. 
As an example, let us take a = | andr = +; then we have: 


1 1 —(k-1) i 
l+a+4a4+-:-4+2 = 


Ni 


Ee ee a sa 
Thus the sequence S,, $5, S3,... 1S now 
2 — 1,2 — 52 —5, ... 
which converges to 2, so we can write 


1+5+a+-::=2. zs 


Exercise 1 


Obtain a formula for the kth partial sum of 
1-1+1-141... 


Does the series converge or diverge? = 


38 


FM 14.2.4 


Definition 1 


xn 


Definition 2 


Definition 3 


Notation 1 


xr 


Definition 4 


xnxrt 


Example 1 


Definition 5 


xert 


(See RB 8) 


Exercise 1 
(2 minutes) 


Exercise 2 
For what values of r can we define a sum for the infinite geometric series 


l+r+rtr+--- 


FM 14.2.4 


Exercise 2 
(3 minutes) 


and what is the formula for the sum in each case? S 
Exercise 3 Exercise 3 
(3 minutes) 
For a given function f and given numbers x and a, if the corrections 
C,(x) satisfy 
lim C(x) = 0, 

k large 
what can we conclude about the infinite series: 

f(a) + fay(x — a) + zf"(a)(x — a)? +---? 
What can we conclude if the corrections do not satisfy the above condi- 
tion? a 
Exercise 4 Exercise 4 

(S minutes) | 

The “snowflake curve” is the limit of a sequence of polygons formed as —: 2esk- ab Qn 

- ‘ ee 
follows: oes Linde Attys | y ee ae 

“ eas A ¢" ‘ 
hAtye 
( 
first polygon: 
second polygon: 
: third polygon: 


39 


(continued on page 41) 


Solution | 
0 if k is even 
es t if k is odd 
and, since the sequence of partial sums 1,0, 1, 0,... diverges, the series 
also diverges. Ee 
Solution 2 


The formula given in Example 1 gives, for the kth partial sum, 


S=itre es ee = (re R,r # 1). 


We are interested in the behaviour of this expression for large k. This 
depends on the value of r, and so there are several cases to consider. 
(i) If|r| < 1, then lim r* = 0, and so 
k 


large 
= 1 
lim na 


klarge 1 — TI aoe 


In this case the series converges and its sum is 


(ii) If r = 1, then the formula for S, does not apply; we see that S, = k, 
and so the series diverges. 

(iii) If |r| > 1, then |r| increases with k, without any bound, and so the 
series diverges. 

(iv) If r = —1, we have the series 


1—-1+1-141... 


which, as we have seen in Solution 1, diverges. = 


Solution 3 
The kth partial sum, S,, of the infinite series is the (k — 1)th degree 
Taylor approximation to f(x) about x = a. Thus we have 
f(x) = S; 
with correction C,(x); or, in other words, 
f(x) = S, + Ci). 
Thus, if we are given that lim C,(x) = 0, it follows that 


klarge 


f(x) = lim S,. 


k large 
Consequently, the infinite series converges and its sum is f(x). 
In the cases when lim C,(x) is either non-existent or different from zero, 


large 


the conclusion is that the series either diverges or converges to a limit 
different from f(x). | e 


40 


FM 14.2.4 


Solution I 


Solution 2 


Solution 3 


FM 14.2.4 


and so on. At each stage, every line segment (continued from page 39) 


in the old figure is changed to 


in the new figure in such a way as always to increase the enclosed area. 


Calculate the limiting area enclosed, taking the area of the triangle (first 
polygon) as 1 unit. What can be said about the limiting length of the 


perimeter? (All angles are 60° or 120°.) = 
Exercise 5 Exercise 5 
: (3 minutes) 
Consider the function 
fix, (xeR) 
5 x € R). 
1+ x? 
By considering the geometric series 
| — x* + x* — x° + «-: 
; : ee | 
obtain a sequence of polynomial approximations for ane. (These 
| x 
approximations are the Maclaurin polynomials for f.) Use the results 
of Exercise 2 to find the set of values of x for which this sequence con- 
| 
verges to i 
: 1 + x? 
Exercise 6 Exercise 6 


(3 minutes) 
In Unit 13, Integration II, section 13.2.6, we mentioned the possibility 


of evaluating z using the formula 


: 1 
= YS? F 
| 1 i x7 


Use the approximations obtained in the preceding exercise to get a 


: ue , 
sequence of successive approximations to : Assuming that this sequence 
er ee : 
really does converge to the limit mt write down an infinite series whose 


ee 
sum is 71" 


4] 


Solution 4 


Let a, be the area of the triangle (=1), and for each n = 2, 3,... let a, 
be the area added at the (n — 1)th stage. This area is added in the form 
of b, congruent triangles, each having one-third of the linear dimensions, 
and therefore 4 of the area, of those added at the previous stage. Thus, 
the area of each triangle added at the (n — 1)th stage is (¢)""', and so 


a =b, x" (a = 23... 3. 


Now b,, the number of triangles added at the (n — 1)th stage is equal 
to the number of line segments created at the previous stage. At the 
first stage, the number of line segments created is 3, and this is multiplied 
by 4 each stage. Thus, 


b=3x a in 42 3, cd. 
This gives, when substituted in the previous equation, 
a, = 5 x pies ee a re 
The total area at the (n — 1)th stage is 
G+ 44> Say, 
and so the limiting area can be expressed as the infinite series: 
1 4 : 1 


ee ee 
ee Be, ee ees 3 


which is a geometric series with common ratio $, excluding the first term, 
which is an “‘odd man out”. The sum is therefore, by the result of Exercise 2, 
4 

9 > 


whe 4s 
9 


1 
eS ae , where a=-— and r= 
l-—r 3 


= 13, 
which is thus the limiting area. 


The length of the perimeter of the snowflake does not fare so happily, 
however. At each stage the number of line segments is multiplied by 4, 
and the length of each line segment is divided by 3, so that the total length 
of the perimeter is multiplied by } at each stage. Thus the lengths of the 
successive polygons are 


2 4\? [4\° 
ee ign Ses SS See 


This sequence diverges, and in fact the length increases beyond all bounds. 
There is no “limiting length”’. | & 


Solution 5 
The series 
l= x* + x" — x" 41% 
is a geometric series with common ratio (—x7~), and therefore has the sum 


1 = 
ee re ee 


whenever |—x?| = x? < 1; that is, whenever |x| < 1. The series diverges 
if |x| > 1. The partial sums of this series are the polynomials 


2 Be ae oe ae eS eee 


which accordingly form a convergent sequence of approximations to 


a 
; if |x| < 1, but not otherwise. e 
1+x 


42 


FM 14.2.4 


Solution 4 


Solution 5 


Solution 6 

Since the sequence of successive approximations obtained in the preceding 
1 ; ; 

exercise converges to ae for all x in the interval of integration, with 

the single exception of the end-point 1, it is reasonable to guess that by 


— ; ee 
successively substituting these polynomials for oS in the integral we 
x 


; ‘ : ; us ; 
shall obtain a sequence of successive approximations to a This sequence 


13: 


: 1 : 1 
| wt x4) = |x | = {= 
0 ai 3 


[wor x* + x*) a = s| 1 es 
— =|x x—=x°+=x°] =1— += 
0) 3 5 re) = 7 


and so on: the nth approximation in the sequence is 


~~ I ae +1{-[f 
ee SS n+1 
The corresponding infinite series is 
—, 1 S a ‘ 
ee oe ee 
(This infinite series is mentioned in the radio programme.) x 


The infinite series notation provides a convenient way of summarizing 
the type of result obtained in the second part of this text. For example, 
by writing 
x x x x 
ee og (x € R) 
we can concisely express a statement that would otherwise look something 
like this: 


“the correction C,(x) to the nth degree Maclaurin approximation 


—| 
, Zz (- p= tg 
ee om’? 


mar 


where m = nif nis odd, and m = n — 1 if n is even, satisfies ‘ 


im. € ix} = 0 


n large 
for all xe R”’. 
Similarly, by writing 
1 
ree ee ixeR xi <1 
— x | 
we paraphrase the statement: 


“if |x| < 1, then the sum S,(x) of the geometric series 
P+x+x74---4 x77! 

satisfies 
lim S,(x) = ———”’. 
n large 1—x 


To conclude this section, we summarize (for reference) a number of useful 
formulas of this kind. Some of them embody results already obtained in 
this unit, and some are new. 


43 


FM 14.2.4 


Solution 6 


Discussion 
a 


FM 14.2.4 


. a ee 
oie eee (x eR); 
ee ees 
ia ee qi Tees (xe R); 
— 2 x3 x4 = 
expx = ts te (eR; 
z 3 4 
inti + =x + ee (x € R, |x| < 1) 
— | —1)\(a —2 
ee ee io ss 
2! ce 
(x € R, |x| < 1), 


where « is any real number. If « is a positive integer or zero, then all the 
terms of the last series after the (x + 1)th are 0, so that the series reduces 
to a polynomial of degree «, and for these values of « the formula holds 
for all real x and not just for those satisfying |x| < 1. 


Exercise 7 


Exercise 7 ' 
(2 minutes) 


Transcribe the following formula into a statement about Taylor poly- 
nomials: 


1 1 i x 3 tx 37x35 
—————Sas- = —— —— ee A eee oe I 2: 
fx ae aa ers SS 


13 eS Se HAE 


=e ro 2 ee ee 
2x 4x6 %--- x Qn) eee 


(xe R* andx <2) EH 


+04) 


a4 


14.2.5 Appendix (Not Part of the Course) 

Demonstration of Taylor's Theorem 

We wish to show that if B satisfies the inequality 
ie "OB ele ep, 

then 


| 
e a = n+1 
CaCO < FB — 4 


where 


| 
C,(x) = f(x) — [fa + (x — a)f'(a) + 5 (x — ay f(a) +--- 


+O = area) 
nN 


Inequality (1) is intended to imply that the (n + 1)th derivative of f exists 
at all points in [a, x]. We define the function 


1 
Ct f10 = | fla) + (t= ofa) + HE af") + ~~ 


+t = orga (t €[a, x]). 


This definition is consistent with the definition of C,(x) already given, 
and it makes sense for all points in the domain of f, since we have stipulated 
that f isn + 1 times differentiable at all points in this domain. 


We shall estimate C,(x) by estimating its (n + 1)th derivative and then 
integrating n + | times. Differentiating the function C,, we obtain: 


Cit) = f(t) — fo Se ae 


I See 
a oe. sae & ‘a 


cu = 0) = | Fa) + + te arpa) 


Cr"(t) = 7 =F 
Saas it) a roe (rt) 


where t € [a, x] in each case. 
Combining the last equation with Inequality (1), we obtain the estimate 


ic Wis B  (te[a,xp. 


We can use this information to estimate C,(x) itself, by a succession of 
n + | integrations. 


For simplicity we confine detailed discussion to the case where x > a, 
and to the upper bound on C\"* '(t) implied by Inequality (3). The other 
cases can be treated similarly. The upper bound on C\"*?(t) given by 
Inequality (3) is 


Cth) << B  (te[a,x)). 


Integrating from a to s, where sé [a, x], gives (see diagram): 


pest (t-——> B) 


45 


FM 14.2.5 


14.2.5 
Appendix 


Inequality (1) 


Inequality (2) 


Inequality (3) 


(continued on page 46) 


Solution 14.2.4.7 


The following is one of the many possible answers to the question: 
HxeR” and x =< Z, then 


2 ee ee. 2n — 1 
(= = ur} an. 


Your answer may look very different from this, but it should include the 
following: 


(i) the restriction that x must lie between 0 and 2; 
(ii) the polynomial 
1x3x--- x (2n—1) 


1 — {x - ee ee 1)" 


(this is the nth degree Taylor polynomial approximation to the 


; 1 
function x-——> —— about 1); 
at 
= ; 1 
(iii) the statement that the difference between —— and the above poly- 
x 
nomial (this difference is the correction to the nth degree Taylor 
approximation — or, with reversed sign, the error) approaches zero 
(that is, the error approaches zero) as n increases. 


(continued from page 45) 


Cpl +1) (t) 


tr>C,'"*" (t) 


Evaluating the integrals with the help of the Fundamental Theorem of 
Calculus (Unit 13, Integration II) gives, (since C’"*') = DC“, by defini- 
tion) 

Cs) — Ca) < (s — a)B. 


But the equation which we obtained earlier for C\(t) shows us that 
Ca) = 0, and since the last inequality holds for all s in [a, x], it follows 
that: 


C(t) < (t — a)B (tela, x]). 


46 


FM 14.2.4, 14.2.5 


Solution 14.2.4.7 


FM 14.2.5 


Now we can repeat the procedure and reduce the order of the derivative 
of C, one further. Integration from ato s, with s € [a, x], gives (see diagram): 


[ CM < [ oe — a)B) 


Cr") (t) 


tre C,!")(t) 


That is, 
Crs) = CS ee) a Hs — oF B 


But, once again, we have C{"" (a) = 0, and since the last inequality 
holds for all s in [a, x], it follows that: 


Ce-) <4t—a?B (te [a, x). 


Repeating the procedure n — 1 more times we obtain: 


Cot) < a(t — a)°B (te [a, x) 
Cr) < Tle ay B (te [a, x) 
and finally 


I n+ 
C,(t) < ae — a)"*'B (t €[a, x]). 
This is precisely the upper bound on C,(t) given by Taylor’s Theorem 
in the case x > a (since then we have t > a, so that t — a is the same as 
|t — a|). Applying the same procedure for lower bounds, and for the 
case where x < a, we can complete the demonstration of the form of 
Taylor’s Theorem given in the text. 


The argument we have given is not, strictly speaking, a proof, since we 
have relied on diagrams to demonstrate results of the form 


g(t) < h(t) = (te [a, x}) 


implies 


[e<| h. 


It is not difficult to prove these results directly from the definition of an 
integral, but such proofs are really beyond the scope of the Foundation 
Course, since we have not put the properties of the real numbers on an 
axiomatic basis. 


47 


Unit No. 


OoaonrnnNUN PWN 


48 


NO TEXT 


NO TEXT 


NO TEXT 


NO TEXT 


Title of Text 


Functions 

Errors and Accuracy 
Operations and Morphisms 
Finite Differences 


Inequalities 

Sequences and Limits I 
Computing I 
Integration I 


Logic I — Boolean Algebra 
Differentiation I 
Integration II 

Sequences and Limits Il 
Differentiation II 
Probability and Statistics I 
Logic II — Proof 
Probability and Statistics I 
Relations 

Computing II 

Probability and Statistics III 
Linear Algebra I 

Linear Algebra II 
Differential Equations I 


Linear Algebra III 
Complex Numbers I 
Linear Algebra IV 
Complex Numbers II 
Groups I 

Differential Equations II 


Groups II 

Number Systems 
Topology 

Mathematical Structures 


a, 
Xe 14X 
n=1 


OVERLAY 24 


XP 14x4 x? 
n=2 


$ 


OVERLAY 25 


Ps 


Xm 14X%4x74x74 x4 
n=4 


OVERLAY 26 


Km 144x274 x94 x44 54 KO + x? 
n=7 


OVERLAY 27 


CORE Se eae ere cee 


py a ee 


: bencaa 
Aco he ore 


LA Riad dig bs 


the successive Maclaurin approximations to (1 — x) '; these are shown 
in the following diagram* and its overlays: 


y 
xno 14x 
n-1 

= 1 

Xe 


x 


OVERLAY 24 


The graphs show that the nature of the approximation is not the same 
as in the case of sin x: for sin x the interval over which the approximation 
is good gets wider as the degree of the polynomial gets higher; but for 
(1 — x)~! the interval of good approximation is always contained within 
the interval [—1, 1]. 


These results show that the Taylor (Maclaurin) approximation method 
is quite temperamental: sometimes it is very effective, but on other 
occasions the approximations it produces are wide of the mark. The 
method is a very powerful one, but to be able to use it without getting 
into trouble one needs either very sound intuition or some theorems 
that will specify the situations in which the method is successful. In the 
next section of the text we shall leave the exploratory approach we have 
been using and look at the theory of the Taylor approximation method 
from a rigorous point of view. 


* We have drawn only the part of the graph for which 1/(1 — x) > 0; the dotted line is the 
line specified by x = I. 
The overlays are in the wallet on the inside of the back cover of this text. 


26 


FM 14.1.5 


the successive Maclaurin approximations to (1 — x) '; these are shown 
in the following diagram* and its overlays: 


y 
7 xe 1oxex? 
n= 2 
be 
Xhe 1 OVERLAY 25 


The graphs show that the nature of the approximation is not the same 
as in the case of sin x: for sin x the interval over which the approximation 
is good gets wider as the degree of the polynomial gets higher; but for 
(1 — x)~! the interval of good approximation is always contained within 
the interval [—1, 1]. 


These results show that the Taylor (Maclaurin) approximation method 
is quite temperamental: sometimes it is very effective, but on other 
occasions the approximations it produces are wide of the mark. The 
method is a very powerful one, but to be able to use it without getting 
into trouble one needs either very sound intuition or some theorems 
that will specify the situations in which the method is successful. In the 
next section of the text we shall leave the exploratory approach we have 
been using and look at the theory of the Taylor approximation method 
from a rigorous point of view. 


* We have drawn only the part of the graph for which 1/(1 — x) > 0; the dotted line is the 
line specified by x = I. 
The overlays are in the wallet on the inside of the back cover of this text. 


26 


FM 14.1.5 


the successive Maclaurin approximations to (1 — x) '; these are shown 
in the following diagram* and its overlays: 


y 
7 Xe 14x4x7 4074 xf 
'n=e4 
3 ere 
ee ees = OVERLAY 26 
x 


The graphs show that the nature of the approximation is not the same 
as in the case of sin x: for sin x the interval over which the approximation 
is good gets wider as the degree of the polynomial gets higher; but for 
(1 — x)~! the interval of good approximation is always contained within 
the interval [—1, 1]. 


These results show that the Taylor (Maclaurin) approximation method 
is quite temperamental: sometimes it is very effective, but on other 
occasions the approximations it produces are wide of the mark. The 
method is a very powerful one, but to be able to use it without getting 
into trouble one needs either very sound intuition or some theorems 
that will specify the situations in which the method is successful. In the 
next section of the text we shall leave the exploratory approach we have 
been using and look at the theory of the Taylor approximation method 
from a rigorous point of view. 


* We have drawn only the part of the graph for which 1/(1 — x) > 0; the dotted line is the 
line specified by x = I. 
The overlays are in the wallet on the inside of the back cover of this text. 


26 


FM 14.1.5 


the successive Maclaurin approximations to (1 — x) '; these are shown 
in the following diagram* and its overlays: 


Mrmo 14x 4/024 x3 4x4 
n=7 


OVERLAY 27 


The graphs show that the nature of the approximation is not the same 
as in the case of sin x: for sin x the interval over which the approximation 
is good gets wider as the degree of the polynomial gets higher; but for 
(1 — x)~! the interval of good approximation is always contained within 
the interval [—1, 1]. 


These results show that the Taylor (Maclaurin) approximation method 
is quite temperamental: sometimes it is very effective, but on other 
occasions the approximations it produces are wide of the mark. The 
method is a very powerful one, but to be able to use it without getting 
into trouble one needs either very sound intuition or some theorems 
that will specify the situations in which the method is successful. In the 
next section of the text we shall leave the exploratory approach we have 
been using and look at the theory of the Taylor approximation method 
from a rigorous point of view. 


* We have drawn only the part of the graph for which 1/(1 — x) > 0; the dotted line is the 
line specified by x = I. 
The overlays are in the wallet on the inside of the back cover of this text. 


26 


FM 14.1.5 


